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Description 

Bacteria, in common with mammalian and other eucaryotic cells, possess a proteolytic system for the 
selective degradation of abnormal or no longer needed polypeptides, in the natural course of events, 
abnormalities may arise by either mutation or biosynthetic error. Alternatively, a normal polypeptide may 
be needed in the cell only at a specific time to carry out a specific vital function while, at any other time, the 
presence of that polypeptide is deleterious to the cell, so that, once the polypeptide has earned out its 
function it is selectively destroyed. These operations imply a complex network of interacting regulatory 
and structural elements, and, indeed, in Escherichia coll, the best studied bacterial system, eight different 
endoproteases have so far been described which function, individually or in concert, in this selective 

degradation system. r t . _ . . 4 . . . 

The introduction of foreign genes into a microorganism often has inadvertently activated this 
proteolytic system, thereby inhibiting the large scale production of a desired gene product by the practice 
of genetic engineering. This problem is particularly acute when genes coding for eucaryotic peptides are 
introduced into the bacteria. The strategy for circumventing this problem has been to fuse the nucleotide 
sequence coding for the desired gene product to at least part of a structural gene, or carrier protein gene, 
whose product is known to be constitutively stable. 

The tandem arrangement of desired gene product and carrier protein results in the expression from the 
organism of a hybrid polypeptide which is stable, provided that the carrier protein is large enough to 
identify the entire hybrid polypeptide as both native and constitutively stable. The prior art difficulty of 
direct expression of eucaryotic polypeptides and the necessity to use a carrier protein to achieve 
expression is well recognized and has been described in a number of scientific papers and published patent 
applications and patents, for example, in U.S. Patent No. 4,342,832. A typical patent which describes the 
use of this technique, for the production of somatostatin, is U.S. Patent No. 4,366,246. 

A disadvantage to this approach, however, is that the desired product constitutes only a small portion 
of the hybrid polypeptide, thereby decreasing significantly the theoretical yield, and necessitating further 
processing to split off the native carrier protein and to purify the desired polypeptide product. Efforts to 
decrease the native carrier moiety to a small portion of the hybrid product in an attempt to improve the 
yield, however, render the expressed hybrid polypeptide unstable. 

The symbols, abbreviations and some of the terms used herein have the following meanings: 

DNA — deoxyribonucleic acid 

A— Adenine 

T— Thymine 

G — Guanine 

C — Cytosine 

Tris— 2-Amino-2-hydroxy-ethyM ,3-propane diol 

EDTA — Ethylene diamine tetraacetic acid 

ATP— Adenosine triphosphate 

TTP— Thymidine triphosphate 

GTP — Guanosine triphosphate 

T4 — Bacteriophage specific to £ colt 

Nucleotide— Nucleoside phosphate 

Codon— Three nucleotide sequences specifying a particular amino acid 

Oligonucleotide— Polymer consisting of a number of nucleotides 

Plasmid— Self-replicating, extrachromosomai genetic element 
In accordance with the present invention, it has now surprisingly been found that this prior art problem 
can be satisfactorily overcome and the natural, selective proteolytic mechanisms of the microorganisms 
can be circumvented to achieve high yield production of a desired eucaryotic polypeptide. This is achieved 
by tandemly-linking a plurality of polypeptide genes through selectively cleavable ammo acid sequences 
incorporating the resulting DNA construct into a cloning vehicle, and transforming a microorganism with 

^n°Srda^ the present invention, therefore, there is provided a DNA fragment composed of 
repeating units coding for a desired product of known amino acid sequence tandemly-linked by 
oligonucleotides coding for a selectively cleavable amino acid sequence and having no codons for 
operative combinations of stop and start signals between the ends of said fragment, whereby the fragment 
is expressible in a suitable vehicle to produce a polypeptide composed of tandemly-linked repeating units 

of the said desired product. j — * i. 

By providing a DNA fragment composed of repeating units coding for a desired product of known 
amino acid sequence and separated by oligonucleotides coding for selectively cleavable amino acid units, 
a replicable cloning vehicle can be provided which is capable of stably expressing a product in a microbial 
organism. Once the product has been expressed during growth of the microorganism, it may be isolated 
from the cell culture, and the desired polypeptide product is obtained from the isolated product by cleaving 
the repeating units at the cleaving sites to form a plurality of individual units. Since multiple copies of the 
desired polypeptide product are expressed from the microorganism, the yield of product is considerably 
; enhanced with respect to the prior art expression procedure discussed above. 
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The present invention has genera, applicabil^ the ^^^^J^^^S^ 

growth hormone, somatostatin and human interferon Sam ent which contains at least two units 
Essential to the present invention ,s the formation of a ^J^^afed by a joining unit whose 
each coding for a desired product of known a™" 0 *^^^ 

SoSng -ent used for Ceaving also 

a half-life of sufficient duration to enable extraction thereof to J* J™* 60 - p 

invention products having a half-life 8"^' * 8n "J" ™Z session of human proinsulin, i.e. the 
^^^^ £?Mi^i A chains, but the princip.es thereof are 
widely applicable to any polypeptide, as discussed above . drawings, in which: 

S^K^^ accordance with this 



,nV6 F gure 2 is a derivation and restriction site map o p asm d plac 
Figure 3 is a derivation and restriction site map rrf ptamnd pTac API. 
Figure 4 illustrates the sequence of steps to form P |a ^<J PAT/PIR. 
Figure 5 is a derivation and restriction srte map of plasm.d PATJPW. f tion of | ac 239 . P , and 
Figure 6 illustrates the sequence of steps to form plasm.d plac 504/PI, and tormauon P 

n '^C^tTllerivation and restriction site map of plasmid plac : 504/PI; 

S^r^S^ ^^^^ on various products 

produced in accordance with this invention. nra duction of human proinsulin in accordance 

In the utilization of the present invention , to effect the P r °° u «'° n ° T _, a j n|no £ p , ura | it y of sequences 
o with the preferred embodiment hereof a DMA fragmen » "J se , e 3vely-cleavable 

,s of expressing the fragment in a microbial or 9 an ^ m ; _ roinsuMn gen e and inserting the same into a 
The preferred method of constructing the niult.ple P™nsuiir ' B™« « 

cloning vehicle, in both a fused and an "" f " s ^ 

configuration, the human pro.nsul.n coding W^J*WJ*™1* p a | actos ^ ase . In the unfused 

K i SSSJSSSS -ter in the case of the fused 

configuration, as compared with the "^ h ^^°£ nn is too unsta ble to be isolated as a product 
In both configurations, the single i gene ^"^^^^b sequences are added to the genetic 

, 5 from the microorganism but, as additional human ' Promsul n eoang p ^, vp eptide becomes increasingly 
construct by the method of Figure X the '^^ff^S^X^ "> the unfused 

stable, approaching normal stability -at J"^,^^^ £ fuse d configuration. As increasing 
configuration, and at two tandemly-l.nked p remsuhn domains _> ™ b f codi sequen ces is 

numbers of proinsulin coding sequences are tandemly linked ^J'^i, tne number of tandemly- 

so reached, beyond which the yield of po ymer decl.nes. Fo .example^ . ncreas g of 
ffnked u'nits in the unfused °°"*9^^^^ coding sequences in 

polymeric product being produced by the ce II. J » pw»r ^uerieaafn the fused configuration. 

tSSSSrCSS^ in the embodiment illustrated in 

55 Figure 1 is represented as follows: 

Proinsulin gene-Arg Arg Asn Ser Met-Proinsulin gene 

AGG CGC AAT TCT ATG 

60 The amino acid domain junction is easily 

proinsulin peptides. This domain junction i an Irtt .manner ^of cleaving 

linking units and modes of cleaving w ^!* m ^ o a n i°^ r a oup cyanogen bromide cleavage may be used 
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junction while for those polypeptides which do contain the methionine group, the design of a particular 
amino acid sequence positioned between desired polypeptide segments to provide an eas.ly-c eavable i site 
mTbe effected by genetic engineering methods. For example, polypept.des wh.cn do not contain an 
aspartyl-proline bond may be engineered to contain such a bond between each un.t of the multi-coding 
s sequence construct, so that recovery of the native material from the isolated multi-doma.n polypeptide 
may be achieved foliowing^leavage by acids, as described by Jauregui-Adell and Marti, Anal. Biochem. 69, 

468 (1975). -r 
In general, enzymatic methods or chemical reactions which are amino acid residue-sequence specific 

mav be used to effect cleavage of linking groups between the desired polypeptide genes. 

10 The authentic human proinsulin molecule contains no methionine residues, so that the cyanogen 
bromide treatment cleaves a multi-domain proinsulin molecule into several proinsulin analog moieties 
along with the C-termina! authentic proinsulin unit. This results from the designed placement of 
methionine codons at the 5'-end of each of the proinsulin coding sequence units. 

The individual proinsulin units derived in this manner may be further processed into authentic human 

is insulin by digestion with trypsin and carboxypeptidase B, or by any other convenient procedure. 

The formation of a recombinant plasmid containing an oligonucleotide sequence coding for multiple 
copies of human proinsulin leads to a considerable increase in yield of proinsulin and proinsulin analogs, 
when compared with the expression in the prior art of the hybrid molecule containing the p-galactosidase 

20 As seen in Figure 1 , the preferred sequence to form the novel plasmids pTac/2P\ and plac 239/2PI, and 
other multiple PI unit plasmids especially the expression plasmids p7ac/5PI and plac 239/3PI which is 
further engineered as illustrated in Figure 8 to form the preferred expression plasmid plac-3Pl, starts from 
the plasmid pAT/PIR which contains an oligonucleotide sequence between the EcoRI and BamHI restriction 
sites coding for human proinsulin. The derivation from the known plasmids pBCA4 of the plasmid pAT/PIR, 

25 which is also a novel plasmid, is illustrated in Figure 4. The pTac and plac 239 fragments are derived from 
plasmids pTacfPl and plac 239/PI respectively, which in turn are both derived from another novel plasmid. 
plac 504/PI, as shown in Figure 6. The derivation of plasmid plac 504/PI from known plasmids is also shown 

'it is not essential to obtain the p7ac and plac vectors from the plasmids p7ac/PI and plac/PI i.e. 

30 plasmids containing a single copy of the proinsulin gene. They may be derived from any other convenient 
plasmid which contains the 7ac and lac promotor respectively, and which can be cleaved at EcoRI and Bam 
HI sites to form the vector for ligation with the multiple proinsulin gene fragment. Similarly, it is not 
essential to form the plasmid plac 504/PI, i.e. also containing the single copy of the proinsu in gene, as an 
intermediate, but any other convenient intermediate plasmid may be employed. The particular single copy 

35 proinsulin gene plasmids here embodied were formed by the inventors during the experimentation which 
led to this invention in an attempt to provide constructs from which proinsulin would be stably expressed. 
As noted earlier, this was not found to be possible with the single copy gene. . 

Specific experimental details for the formation of the various plasmids set forth in the drawings and 
disclosed above are discussed in the Examples below. t.-jedi ^ti 

ao Detailed derivations and major restriction site maps for the novel plasmids plac 239-3PI. p7ac/5PI, pAT7 
PIR and plac 504/PI appear as Figures 2, 3. 5 and 7 respectively. £. coli JM103 modified by each of the 
plasmids plac 239-3PI and P 7ac/5PI, have been deposited on July 6, 1984 with the American Type Culture 
Collection located at Rockville, Maryland. U.S.A.. and have been accorded the follow.ng accession Nos.: 
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Organism £ co//7plasmid plac 239-3PI— ATCC 39760 

Organism £ co///plasmid P 7ac/5P1-ATCC 39759. 

The invention is illustrated further by the following Examples. 

^ This Example describes the formation of novel plasmids containing multiple units of the human 
proinsulin gene. 

(al Tf^SdLA™ was prepared following the sequence outlined in Figure 4 and the restriction map 
The plasmid pAT/PIR was P re P ar 5°J°"^'"%^ contains a nucleotide sequence coding for 

for the plasmid appears as Figure ^5 The^ startng How*. If the EcoRI-BamHI fragment of 

human proinsulin between its EcoRI and BamHI restriction is tes . " ow ' b modified as described 

this plasmid is used directly, the ^^'J^S^^SKJSt obtained from Professor 
below. Plasmid pBCA4 is described in Narang et al, bene, 7/, z/t) t iao*i 

R. Wu. an author of that paper of Cornell 1 ^'versity^ Ithaca J.Y,WA^ ^ 
The plasmid pBCA4 was cut with F °" ^J^ ^Sd^mHI fragment to extend the 5'-end of 
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synthetic nucleotide sequences were generated chemically and comprised the sequences: 

5' AATTCTATGTTTGTCAAT 3' 
3' GATACAAACAGTTAGTGG 5' 

The starting plasmid pAT153 is a high copy variant of the well-known plasmid pBR322 with the so- 
called "poison" sequences removed and is described by Twigg et al. Nature, 283, 216 (1980). The plasmid 
PAT153 was obtained from Professor O. Smithies, Dept. of Genetics, University of Wisconsin— Madison, 
Madison, Wisconsin, U.S.A. The plasmid pAT153 was cut with EcoRI and BamHI and the vector isolated. 
The isolated vector was ligated to the EcoRI-BamHl (PIR) fragment, to form the new plasmid pAT/PIR. 

(b) Preparation of multiple copies of proinsulin gene b . . 

The sequence of operations required to form multiple copies of the proinsulin gene is outlined in 
Figure 1. Synthetic oligonucleotides (A) and (B) were prepared chemically to provide the following 
sequences: 

(A) 5' CCTCTACCAGCTGGAGAACTACTGCAACAGGCGC 3' 

(B) 3' ATGGTCGACCTCTTGATGACGTTGTCCGCGTTAA 5' 

200 p.mo!es of oligonucleotide B and 200 p.moles of oligonucleotide A having its 5'-end labelled with 
32 P-ATP were mixed in 30 ul of 66 mM Tris-HCi (pH 7.6), 6.6 mM MgCl 2 and 10 mM dithiothreitol, and 
heated at 80°Cfor 15 minutes to anneal the oligonucleotides to form a part of the human proinsulin coding 
sequence beginning at the SfaNI cut site near the 3'-end of the coding sequence and terminating with the 
last proinsulin codon (AAC), followed by the additional sequences: 

_ 5' AGGCGC 3' 

30 3' TCCGCGTTAA 5' 

These additional sequences of the synthetic oligonucleotides encode two arginine residues end 
provides a sticky end for ligation to an EcoRI sticky end. The ligation destroys the EcoRI site because of the 

CG base pair (boxed in Figure 1). . 4U An . 

35 10 ug of pAT/PIR, prepared as described in (a) above, was digssted for 1 hour at 37 C with 40 units each 
of EcoRI and BamHI in 40 pi of digestion buffer. The digested DNA was dephosphorylated by digestion at 
37°C for 30 minutes with 4 units of calf intestine alkaline phosphatase m 200 mM Tns-HCI, pH 8.0. 1 he 
proinsulin gene fragment (PIR) was isolated from 1% agarose gel by phenol extraction. The isolated 
fragment was further digested for 3 hours at 37X with 6 units of SfaNI in 40 pi of digestion buffer and the 

40 EcoRI-SfaNI fragment of PIR was isolated. t . 

100 p.moles of the annealed oligonucleotides (A) and (B) and approximately 4 p.moles of the isolated 
PIR EcoRI-SfaNI fragment were ligated for 14 hours at 15X in 30 pi of 66 mM Tris-HCI (pH 7.6), 6.6 mM 
MgCU, 10 mM dithiothreitol and 1 mM ATP using 20 units of T4 DNA ligase As a result of 
dephosphorlyation of the proinsulin gene fragment prior to SfaNI cleavage, and phosphorlyation of only 

4s dlioonucieotide (A), as described above, the reaction was unidirectional, resulting in ligation of the 
proinsulin gene fragment PIR to the annealed oligonucleotide. The ligation reaction product was 
electrophoresed on 1.5% agarose gel and the resultant proinsulin gene analog construct (PI Analog) was 

isolated from the gel. , . . . . . . ... 

The isolated proinsulin gene analog sequence (approximately 3 p.moles) was phosphorylated with 

so ATP and T4 polynucleotide kinase. 10 ug of plasmid pAT/PIR, prepared as described in (a) above, was 
dioested for 1 hour at 37°C with 40 units each of EcoRI and BamHI. Approximately 3 p.moles of the PIR 
proinsulin gene sequence were isolated from agarose gel. The isolated proinsulin gene sequence and 
phosphorlyated proinsulin gene analog sequence were ligated at 37°C for 3 hours in 30 pi of ligation buffer 
containing 20 units of T4 ligase, 40 units of EcoRI and BamHI. # 

« The ligation mixture was run on 1.0% agarose gel and, after autoradiography, the dimer, i.e. two 
proinsulin coding sequences, and trimer, i.e. three proinsulin coding sequences, were isolated from the gel 
by phenol extraction. The tetramer, pentamer and hexamer, i.e. four, five and six proinsulin coding 
sequences respectively, were obtained by further ligating the isolated fragments with additional proinsulin 
gene analogs following the general procedure described above. 

60 (c) Preparation of pTac and plac 239 recombinants containing multiple PI units 

The various PI fragment constructs, containing from two proinsulin codhng sequences up to six 
proinsulin coding sequences, were inserted into the vectors plac and plac 239 by Ration as generally 
described in (b) above. The pTac and plac 239 vectors were formed by digestion with EcoRI and BamHI of 

65 pTacl?\ and plac 239/PI respectively, which, in turn, were formed following the procedures which are 
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outlined below in sections (d) and (e). By inserting a proinsulin gene trimer into the plac 239 vector anda 
proinsulin gene pentamer into the p7ac vector, there were obtained the fused and unfused multiple 
proinsulin gene plasmids plac 239/3PI and p7ac/5PI respectively. The derivation and main restriction site 
map for the latter new plasmid appears as Figure 3. 
5 In similar manner, other multiple proinsulin gene plasmids, including plac 239/4PI, plac 239/5PI, p7ac/ 
3PI, p7ac/4PI, and p7ac/6PI were constructed from the various multimer proinsulin genes and the plac 239 
and p7ac vectors. The various constructs, containing from 1 to 6 copies of the proinsulin coding sequence, 
were confirmed by restriction mapping and DNA sequence analysis. 

10 (d) Preparation of plac 239-PI 

The derivation of plasmid plac 239-PI is illustrated in Figure 6 and was prepared from plasmid plac 504/ 
PI, as shown in detail in Figure 8. The formation of plasmid plac 504/PI is described in more detail below in 
section (f). The plasmid plac 504/PI has a lac promoter, the (3-galactosidase gene {z-gene) complete to the 
EcoRI site near the carboxy terminal of the gene with the human proinsulin gene insert of plasmid pBCA4 

15 fused at this EcoRI site, so as to remain in reading frame, such that the translation of the z-gene continues 
directly to code for proinsulin. 

Referring to Figure 8, plasmid plac 504/PI was digested with Mstll and the resulting sticky ends were 
filled in with dTTP and dGTP. The 5'-T's were then removed by digestion with mung bean nuclease, the 
fragment cut with EcoRI to remove the Mst il/EcoRI segment of the z-gene, the sticky ends of EcoRI were 

20 filled in with dATP and dTTP, and the plasmid was blunt-end ligated to establish a new EcoRi site, the 
resulting plasmid being designated as plac 239/PI. The plasmid was cut with EcoRI at the new site, the 
fragment EcoRI tails were digested with mung bean nuclease, and the resulting fragment blunt-end ligated 
to form the plasmid plac 239-PI in which the correct PI reading frame was restored. In order to restore the 
correct PI reading frame in other fused plasmids, for example, plac 239/3PI, the EcoRI tails should also be 

25 removed by the same procedure described above, to form the corresponding expression plasmid plac 239- 
3PI. The derivation and main restriction site map for plac-3PI appears as Figure 2. 

(e) Preparation of pTac/PI 

The preparation of plasmid pTac/PI is illustrated in Figure 6 and was effected from plac 504/PI and from 
30 plasmid pDR540. The plasmid pDR540 is described by Russell et al in Gene, Vol. 20, 231 (1982) and has the 
following features: a Hindlll site upstream from the promoter, a -35 region from top, the -10 operator, 
Shine-Delgarno, and Shine-Delgarno-to-start spacing regions of lac, with a BamHI site positioned at the 
start. The plasmid pDR540, purchased from P-L Biochemicals, Milwaukee, Wisconsin, U.S A, was cut with 
BamHi and Hindlll, the sticky ends removed with mung bean nuclease and the isolated 7ac promoter was 
35 inserted by blunt-end ligation into the isolated Hindlll-EcoRI fragment of plasmid plac 504/PI which contains 
the PI gene, by first digesting plasmid piac/PI with Hindlll and EcoRI and then filling in the ends with DNA 
polymerase I. In this way, the 7ac promoter of the plasmid pDR540 is substituted for the wild-type lac 
promoter and the z-gene portion of plac 504/PI, thereby preserving the sequence, spacing and properties of 
the hybrid promoter, but with an EcoRI sequence approriately located. 

40 

(f) Preparation of plac 504/PI 

Plasmid plac 504/PI was prepared following the sequence of steps set forth in Figure 6 and the 
derivation and main restriction sites for the plasmid are illustrated in Figure 7. 

The initial step in the formation of plasmid plac 504/PI was the formation of plasmid plac 19 from 
45 known plasmids pMC1396 and M13mp7. The plasmid pMC1396 is described by Casadaban et al in J. Bact, 
Vol. 143, 971, (1980) while the plasmid M13mp7 is described by Messing et al in Nucleic Acids Research, 
Vol. 9, 309 (1981). Plasmid M13mp7 was purchased from BRL Inc., Gaithersburg, MO, U.SA and plasmid 
pMC1396 was a gift of M. Casadaban of Cambridge, MA, U.SA 

Plasmid pMC1396 was digested with Smal, treated with Bal 31 and ligated with Hindlll linkers to form 
so plasmid plac 9. Plasmid pM13mp7 was digested with Avail and ligated to a Hindlll linker to form plasmid 
plac 1. Both plasmid plac 9 and plac 1 were digested with Hindlll and ligated to form the plac 19 plasmid. 

Plasmid pBCA4, containing the sequence coding for human proinsulin, was digested with EcoRI and 
Sail and the EcoRI— Sail fragment was isolated. Plasmid plac 19 was digested with EcoRI and Sail and the 
resulting vector was ligated with the EcoRI-Sall fragment of plasmid pBCA 4 to form plasmid plac 19/PI. 
55 Plasmid pBGP120 contains the lac promoter and is described in Polisky et al, Proc. Nat. Acad. Sci. Vol. 
73, 3900 (1976). This plasmid obtained from Dr. R. Wu of Cornell University, Ithaca, N.Y., U.SA, was 
digested with EcoRI and Hindlll and the Hindlll-EcoRI fragment containing the lac promoter was separated. 
Following digestion of the p!ac19/PI plasmid with EcoRI and Hindlll, the resulting vector was ligated with 
the Hindlll-EcoRI fragment of plasmic pBGP120 to obtain the plasmid plac 504/PI. 

60 

Example 2 

This Example illustrates the preparation of converted microorganisms from the novel plasmids 
containing multiple proinsulin genes and expression of the polyproinsulin polypeptide. 

65 
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(a) Transformation and orowth °'f!' ls , „ p7Btfm p,, where m | 8 3 to 6. were used to 
tran^^KXSrn'^O^ 

density of 0.1, measured at a wavelength of 560 nm. 

(b) Recovery of expressed polypeptide Q u rtI1 tAtoi? hours The cells were harvested by 

wavelength of 12 nm at 0°C. The sonicated material was c ^ tr ?"^ m$^Sl$£ 
microcentrifuge at 4°C. The centrifuged pellets were dissolved in the sample buffer and analyzed oy 
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showrfXre 9. Proi partMiy -^'^„Tji^^u^^ 
mB 'Si ra phore.,e an.h*s of "-P^,^ 

known to be typically obtained when expressing hybrid polypeptides in the prior an. 

EXa Thll Lample illustrates the isolation and purification of polypeptide fractions and the formation of 

insulin therefrom. 

(a) Recovery of proinsulin sonically disrupted at 0°C using 

About 7 g of cells were suspended in 1 2 ,000 g .for 10 minutes, the 

two 90 second bursts rt » ^'SHS,"', M NaCI 4 Murea ApHBA. using 5 ml/g of cells, resonicated and 
pellet was suspended in 25 mMTris-HCI, 2 MNiW « „Xnmin^ulin the c-ellet was resuspended in GDFT 
recentrifuged at 12.000 g. In order to s °' ub,l * e * e 

buffer (7 M guanidinium ehlorld. ,100 ^S^SS^X^Sffi * effluTon into f volumes of ice- 

(b) Production of insulin nrofluc t was sulfitolvzed by dissolving the material in GE buffer. 

Recovered cyanogen bromide^ d.ges J^JSlS Wfl 10 mg/ml, adding Na 2 S0 a and Na 2 S 4 0 6 
containing 6 M 0"an.d.nium ^ the reason to occur for 2 hours at room 

to concentrates of 0.4 M and ^ M nwp^iveiyji no an g mM ammonium acetate (pH 

temperature. The sulfitolyzed pro duct ™ B » P?"?™^ containing 25% formamide and 50 mM Tris-HCI 
ffffiBKatS^S»i" in the ratio of 200 mg per 2.5x10 cm 
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column. The products were differentially eluted by sodium chloride during application of a 0 to 500 mM 1 
linear NaCI gradient. 

The sulfitolyzed biosynthetic human proinsulin (BHPI-SS0 3 ) was readiiy identifiable by HPLC-analysis 
of the salt gradient elution profile. BHPI-SS0 3 was collected and dialyzed at pH 4.5 to remove formamide 

s and to precipitate the product The pure BHPI-SS0 3 precipitate was dissolved in RH buffer, containing 50 
mM glycine and 5 mM EDTA (pH 10.5), at 0.5 mg/ml and cooled to -2°C. Ethanethiol was added to 800 \im 
and allowed to react for 24 hours to effect refolding of the molecule. The reaction was stopped by adjusting 
the pH to 7 with HCI and the reaction product was analyzed. Refolded biosynthetic human proinsulin (BHPI) 
constituted the monomeric component of the reaction product. 

10 The BHPI was dissolved in TC buffer, containing 100 mM Tris-HCI (pH 7.6) and 10 mM CaCI 2 , at 2 mg/ 
ml, 10 pg/ml of trypsin and 40 pg/ml of carboxypeptidase B were added, and the mixture was allowed to 
react for 30 minutes at room temperature. The reaction was stopped by adding acetic acid to 1% and 
biosynthetic human insulin (BHI) was recovered by precipitation. The recovered yield was 98% of 
theoretical, based on the amount of BHPI used. 

15 The BHI was considered authentic on the basis of HPLC analysis, amino acid composition, amino acid 
sequencing and biological activity. Human proinsulin in the form of tandemly-linked units, therefore, is 
stably expressed from transformed cells and human insulin can be readily formed from the expressed 
product. 

In summary of this disclosure, the present invention provides a procedure for expressing polypeptides 
20 from organisms, which involves providing a DNA fragment containing a multiple number of 
oligonucleotide sequences each coding for a desired polypeptide product and tandemly-linked by 
oligonucleotides coding for easily-cleavable amino acid groupings, forming the fragment into a 
recombinant plasmid and transforming an organism by the plasmid. 

25 Claims 

1. A DNA fragment composed of repeating units coding for a desired product of known amino acid 
sequence tandemly-linked by oligonucleotides coding for a selectively cieavable amino acid sequence and 
having no codons for operative combinations of stop and start signals between the ends of said fragment, 

30 whereby the fragment is expressible in a suitable vehicle to produce a polypeptide composed of tandemly- 
linked repeating units of the said desired product. 

2. A fragment as claimed in Claim 1 in a repiicable cloning vehicle capable of expressing the fragment 

in a microbial organism. 

3. A fragment as claimed in Claim 1 or Claim 2, wherein said selectively-cleavable amino acid sequence 
35 includes an aspartyl-proline bond so as to be selectively-cleavable at that bond by acids. 

4. A fragment as claimed in Claim 1 or Claim 2 characterised in that the desired product is human 

proinsulin. . 

5. A fragment as claimed in any of Claims 1, 2 or 4, characterised by from 2 to 6 repeating units each 
coding for human proinsulin and being tandemly-linked through olignucleotides coding for selectively 

40 cieavable amino acid sequences. 

6. A fragment as claimed in any of Claims 1, 2, 4 or 5, wherein said selectively-cleavable amino acid 
sequence includes a methionine residue so as to be selectively cieavable at that residue by cyanogen 
bromide. 

7. A recombinant plasmid, comprising a plasmid vector and an expressible DNA fragment ligated 
45 thereto, characterised in that the DNA fragment is composed of repeating units coding for a desired 

product of known amino acid sequence tendemly-linked by oligonucleotides coding for a selectively 
cieavable amino acid sequence, and having no codons for operative combinations of stop and start signals 
between the ends of said fragment 

8. A plasmid as claimed in Claim 7 wherein said encoded selectively-cleavable ammo acid sequence 
so includes a methionine residue so as to be selectively cieavable at that residue by cyanogen bromide. 

9. A plasmid claimed in Claim 7 characterised in that the desired product is human proinsulin, the DNA 
fragment is joined to the 3'-side of a lac operon fragment encoding the first approximately 80 amino acids 
of p-galactosidase, and the fragment contains from 2 to 5 tandemly-linked oligonucleotide sequences 

coding for human proinsulin. 
55 10. A plasmid as claimed in Claim 7 characterised in that the desired product is human proinsulin, the 
DNA fragment is directly linked to a fragment of the cloning vector containing the Tac promoter followed 
by a bacterial Shine-Dalgarno sequence, and the fragment contains from 3 to 6 tandemly-linked 
oligonucleotide sequences coding for human proinsulin. 

11. A plasmid as claimed in any of Claims 7, 9 or 10 wherein said encoded selectively-cleavable ammo 
60 acid sequence includes a methionine residue so as to be selectively cieavable at that residue by cyanogen 

bromide. 

12. A microorganism which has been transformed by the plasmid claimed in any one of Claims 7 to 11 
for expression of said DNA fragment therefrom. 

13. A method of producing polypeptides by microorganisms, characterised by: 
55 (a) cleaving a vector at a predetermined restriction site; 
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lb) nreoarina an insert for the cleaved vector including a DNA fragment which has multiple copies of 
the aene S^epttde separated by oligonucleotide sequences coding for selectively claavabe 
amino acid seqSeS and which has no codons for operative combinafons of stop and start signals 

s » form a recombin ^ nt plasmid containing the insert: 

Id) transforming a microorganism with the recombinant plasmid. , nillI 

e gSng said microorganism in a culture medium to effect expression of a DNA fragment 
containina said multiple copies of the coding sequence for the polypeptide; 

(f) Mating ™m The microorganism a fraction constituting a polymer composed of the multiple l.nked 

10 conies of the polypeptide; and . 

7o» eleavino the isolated multiple copy polypeptide to single polypeptides, 
i?. A^KdnSlmMl in Claim 13, characterised in that step (f) includes some disrupt™ of cells of 
the microorganism separated from the culture medium. „i„ !mo H i„ riaim 19 characterised 

15 A method of producing polypeptides from a microorganism as claimed m i Claim 12, characterised 
,5 by or^?^ mTcroorganism in a culture medium to effect expression of a DNA fragment contam.ng 
sL mSle cop^s of the coding sequence for the polypeptide and to produce a polymer composed of 
multiple inked copies of the polypeptide; and cleaving the polymer to produce said polypept.de. 

?6 A metho! ! m claimed in any one of Claims 13 to 15 characterised in that the polypeptide is human 

20 Pr0i 17 U A° method as claimed in Claim 16, characterised in that the human proinsulin is converted into 
human insulin. 

Patentanspruche 

25 1 DNA-Fragment, das aus wiederholten Einheiten, die fur ein gewunschtes Produkt mit bekannter 
Aminosa^esSuSodieren und Tandem-artig durch Oligonukleotide verbunden smd, zusammengesetzt 
^ wobefdie O^gSnukleotide fOr eine selektiv spaltbare Aminosauresequenz kodieren p Und Ikeme Kodons 
«r' ^newirksame Kombination von Stopp- und Startsignalen zwischen den Enden des Fragmentes haben, 
wooS fdas Sgmem in e ?nem geeigneten Vehikel exprimierbar ist, urn ein Polypepfd zu erzeugen das 
30 ^Tn^riv!$v*to™n wiederholten Einheiten des gewOnschten Produktes zusammengesem ist 
30 ^ n ^% w n l ch Ansprucn t in einem replizierbaren Klonierungsvehikel. das zur Express.on des 

Fra TTa g mem Tac^ '^nSTo^'ffiu'S, gekennzeichnet, daB die selektiv spaltbare 
Amln^K Jenz ^eineXpar^Proiinbindung einsch.ieBt. urn an dieser Bindung durch Sauren se.ekfv 

35 SPa, 4 b Fragmen n t nach Anspruch 1 Oder 2, dadurch gekennzeichnet, daB das gewunschte Produkt humanes 

Pr0i 5 S "Fragment nach einem der Anspruche 1, 2 oder 4. gekennzeichnet durch zwei bis sechs w^rholte 
Einheiten S jeweils fur humanes Proinsulin kodieren und Tandem-art.g durch Ohgonukleot.de 

spaltbare Aminosiuresequenz einen Methioninrest umfaBt, urn so an d.esem Rest selekt.v durch 
Br0 ^Tkom a b!Santes S pL n smid^ umfassend einen P.asmidvektor und ein damit ligiertes exprimierbares 
* DN ^St U t ^^^^^^^^^ 

„ ^l^pSm^ch TSST^SS***. daB die kodierte se.ektiv spa.tbare 

50 AmiSosSequ e™ einer Eionin'rest umtaft. urn so an diesem Rest se.ektiv durch Bromcyan spaltbar 

2U S 9 in Plasmid nach Anspruch 7, dadurch gekennzeichnet. daB das gewunschte Produkt humanes 
PmiLnn iS das DNA Fragment mit dem 3'-Ende eines Lac-Operon-Fragmentes verbunden ist. das d e 
ss S ^ZS^ZSLm der B-Ga.actosidase kodiert. und daB das Fragment 2-5 Tandem-artig 

-rk^X^^ ^-ttft Produkt humanes 

spaltbare Aminosauresequenz einen Methioninrest umtalit, urn so «jh 

^SKo^it'd- zur Expression des DNA-Fragmentes mit dem Plasmid nach einem der 
6S Anspruche 7—11 transformiert ist. 
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13. Verfahren zum Herstellung von Polypeptiden durch Mikroorganismen, gekennzeichnet durch: 

(a) Spalten eines Vektors an einer vorbestimmten Restriktionsschnittstelle; 

(b) Herstellen eines inserts fur den gespaltenen Vektor einschlielSlich eines DNA-Fragmentes, das 
multiple Kopien des Genes fur das Polypeptid hat, die durch Oligonukleotidsequenzen getrennt sind, die 

5 fur selektiv spaltbare Aminosauresquenzen kodieren, und keine Kodons fur die wirksame Verbindung von 
Stopp- und Startsignalen zwischen den Enden des Fragmentes aufweist; 

(c) Ligieren des Inserts mit dem gespaltenen Vektor, urn ein das Insert enthaltendes rekombinantes 
Plasmid zu bilden; 

(d) Transformieren eines Mikroorganismus mit dem rekombinanten Plasmid; 

io (e) Wachsenlassen des Mikroorganismus in einem Kulturmedium, urn die Expression eines DNA- 
Fragmentes, das die muitiplen Kopien der kodierenden Sequenz fur das Polypeptid enthalt, zu bewirken; 

(f) Isolieren einer Fraktion aus dem Mikroorganismus, die das Polymer konstitutiert und aus muitiplen 
verbundenen Kopien des Polypeptides zusammengesetzt ist; und 

(g) Spalten des isolierten Polypeptids mit muitiplen Kopien zu einzelnen Polypeptiden. 

is 14. Verfahren nach Anspruch 13, dadurch gekennzeichnet, daS der Schritt (f) ein Aufbrechen der Zellen 
durch Beschallen des aus dem Kulturmedium abgetrennten Mikroorganismus umfafct. 

15. Verfahren zum Herstellen von Polypeptiden aus einem Mikroorganismus nach Anspruch 12, 
gekennzeichnet durch Wachsenlassen des Mikroorganismus in einem Kulturmedium, urn die Expression 
eines mehrere Kopien der fur das Polypeptid kodierenden Sequenz enthaitenden DNA-Fragmentes zu 

20 bewirken und ein Polymer zu erzeugen, das aus muitiplen verbundenen Kopien des Polypeptids besteht; 
und Spalten des Polymers, urn das Polypeptid zu erzeugen. 

16. Verfahren nach einem der Anspruche 13 bis 15, dadurch gekennzeichnet, dafc das Polypeptid 
humanes Proinsulin ist. 

17. Verfahren nach Anspruch 16, dadurch gekennzeichnet, daS das humane Proinsulin in humanes 
25 Insulin umgewandelt wird. 

Revendications 

1. Fragment d'ADN compose de motifs codant pour un produit desire de sequence d'acides amines 
30 connue lies en tandem par des oligonucleotides codant pour une sequence d'acides amines clivable 

selectivement et n'ayant pas de codons pour des combinaisons operatoires de signaux d'arrit et de depart 
entre les extremites dudit fragment, ce qui fait que le fragment peut etre exprim6 dans un vehicuie 
approprie pour produire un polypeptide compose de motifs lies en tandem dudit produit desire. 

2. Fragment selon la revendication 1 dans un vehicuie de colonage replicable capable d'exprimer le 
35 fragment dans un organisme microbien. r 

3. Fragment selon la revendication 1 ou la revendication 2, dans lequel ladite sequence d acides 
amines clivable selectivement comprend une liaison aspartylproline de maniere a etre selectivement 

clivable a cette liaison par des acides. ...... , 

4. Fragment selon la revendication 1 ou la revendication 2, caracterise en ce que le produit desire est la 

40 proinsuline humaine. t . m , 

5. Fragment selon I'une quelconque des revendications 1, 2, ou 4, caracterise par de 2 a 6 motifs codant 
chacun pour la proinsuline humaine et lies en tandem par des oligonucleotides codant pour des sequences 
d'acides amines selectivement clivables. 

6. Fragment selon I'une quelconque des revendications 1, 2, 4 ou 5, dans lequel ladite sequence 
45 d'acides amines selectivement clivable comprend un reste methionine de maniere a etre selectivement 

clivable a ce reste par du bromure de cyanogene. 

7. Plasmide recombinant comprenant un vecteur plasmidique et un fragment d'ADN expnmable qui y 
est lie, caracterise en ce que le fragment d'ADN est compost de motifs codant pour un produit desire de 
sequence d'acides amines connu lies en tandem par des oligonucleotides codant pour une sequence 

so d'acides amines selectivement clivable, et n'ayant pas de codons pour des combinaisons operatoires de 
signaux d'arret et de depart entre les extremites dudit fragment. 

8 Plasmide selon la revendication 7 dans lequel ladite sequence d'acides amines selectivement 
clivable encode comprend un reste methionine de maniere a §tre selectivement clivable a ce reste par du 

brumure de cyanogene. . . , . 

55 9 Plasmide selon la revendication 7 caracterise en ce que le produit desire est la proinsuline humaine, 
en ce que le fragment d'ADN est niuni au cot6 3' d'un fragment d'operon lac encodant les environ 80 
premiers acides amines de la p-galactosidase, et le fragment contient de 2 a 5 sequences 
d'oligonucleotides codant pour la proinsuline humaine liees en tandem. 

10 Plasmide selon la revendication 7 caracterise en ce que le produit desird est la proinsuline humaine, 
60- le fragment d'ADN est directement lie a un fragment du vecteur de clonage contenant le promoteur Tac 

suivi par une sequence bacterienne de Shine-Dalgarno, et le fragment contient de 3 a 6 sequences 
d'oligonucleotides codant pour la proinsuline humaine liees en tandem. 

11 Plasmide selon I'une quelconque des revendications 7, 9 ou 10 dans lequel ladite sequence d acides 
amines selectivement clivable encode comprend un reste methionine de maniere a etre selectivement 

65 clivable a ce reste par du bromure de cyanogene. 
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12. Microorganisme qui a ete transforme par le plasmide revendique dans I'une quelconque des 
revendications 7 S 11 pour Texpression dudit fragment d'ADN. 

13. Procede de production de polypeptides par des microorganismes, caracterise en ce que: 
(a) un dive un vecteur en un site de restriction predetermine 

5 (b) on prepare un insert pour le vecteur dive incluant un fragment d'ADN qui a des exemplaires 
multiples du gene pour ie polypeptide separe par des sequences d'oligonucleotides codant pour des 
sequences d'acides amines selectivement clivables, et qui n'a pas de codons pour des combinaisons 
operatoires de signaux d'arret et de depart entre les extremites dudit fragment; 

(c) on lie Tinsert au vecteur dive pour former un plasmide recombinant contenant Tinsert; 

10 (d) on transforme un microorganisme avec ie plasmide recombinant; 

(e) on cultive ledit microorganisme dans un milieu de culture pour effectuer ('expression d'un fragment 
d'ADN contenant lesdits exemplaires multiples de ta sequence de codage pour le polypeptide; 

(f) on isole du microorganisme une fraction constituent un polymere compose de multiples 
exemplaires lies au polypeptide; et 

is (g) on dive le polypeptide a exemplaires multiples isole en polypeptides uniques. 

14. Precede selon la revendication 13, caracterise en ce que I'etape if) comprend une rupture par 
ultrasons des cellules du microorganisme separees du mileu de culture. 

15. Proced6 de production de polypeptides a partir d'un microorganisme selon la revendication 12, 
caracterise en ce qu'on cultive ledit microorganisme dans un milieu de culture pour effectuer ['expression 

20 d'un fragment d'ADN contenant lesdits exemplaires multiples de la sequence de codage pour le 
polypeptide et pour produire un polymere compose de multiples exemplaires lies au polypeptide; et en ce 
qu'on dive !e polymere pour produire ledit polypeptide. 

16. Procede selon I'une quelconque des revendications 13 a 15 caracterise en ce que le polypeptide est 

la proinsuline humaine. 

25 17. Procede selon la revendication 16, caracterise en ce que la proinsuline humaine est transformee en 
insuline humaine. 
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